Protein synthesis is the process by which cells build proteins based on the instructions carried in DNA. It involves two main steps: transcription and translation.
General Procedural Steps - Central DOGMA:
- Transcription (DNA to mRNA)
- mRNA processing (Splicing and adding tails or caps)
- Translation (mRNA to protein)
- Protein folding
Transcription (DNA to mRNA)
Where: In the nucleus (for eukaryotes).
General Process: One DNA strand acts as the template for the mRNA copy in which uracil is used instead of thymine.
Result: A strand of mRNA that carries the code for a protein out of the nucleus to the ribosome.
Initiation:
- RNA polymerase is the main enzyme involved in translation, as it binds to the promoter. The promoter is located BEFORE the gene itself, as a binding site for RNA polymerase. It is simply a specific DNA sequence, like the TATA box typically found in eukaryotic cells.
- When RNA polymerase binds, it separates the DNA strands, only focusing on one template strand.
Elongation:
- RNA polymerase is able to read the template stand (also known as non-coding or antisense) and build an RNA molecule that is complementary to the template stand.
- Remember that the coding strand (or sense strand) is the one that wasn’t used to generate the complementary strand, but it is nearly identical to the new complementary strand (only thymine is replaced with uracil, because it is now an RNA strand).
- Note that RNA polymerase READS the strand from 3’ to 5’ but the strand always GROWS from 5’ to 3’.
Termination
- Another sequence, called a terminator, signals that RNA transcription is finished.
- The transcript is finally released from the RNA polymerase.
mRNA Processing
Modifications to the ends of mRNA:
- Before processing, the mRNA is more specifically called a pre-mRNA.
- The ends of pre-mRNA’s are changed, a 5’ cap is added at the beginning (where the phosphate group on the molecule’s fifth carbon) and a 3’ poly-A tail is added at the end (where the hydroxyl group of the sugar molecule is).
- Note that the 5’ cap is a modified guanine nucleotide, designed to protect from degradation and facilitate translation.
- Note that the 3’ cap is made up of repeated adenine nucleotides, and only eukaryotic cells have them. It protects the DNA from degradation by exonucleases.
Splicing
- Introns are parts of the mRNA sequence that do not code for proteins.
- Exons are parts of the mRNA sequence that contain important information for the translation to proteins.
- During mRNA splicing, introns are spliced out by spliceosomes.
- Exons are reconnected.
- You may be wondering: why do introns even exist if they are going to be spliced out anyways? That’s because they can allow for alternative splicing, where different combinations of exons can occur, leading to different proteins to be made from the same gene. Introns can also affect gene regulation.
Transcription in eukaryotes vs prokaryotes:
- Transcription is controlled individually for each gene in the nucleus for eukaryotes, but in the cytoplasm for prokaryotes.
- Prokaryotes use a single RNA polymerase to transcribe ALL genes
- Eukaryotes use multiple RNA polymerases.
- Transcription and translation can occur simultaneously in the cytoplasm of prokaryotes.
- Transcription and translation are spatially separated by the nuclear membrane.
Translation (mRNA to Protein)
Where: In the ribosome (in cytoplasm).
Key Players: mRNA (messenger), ribosome (the reader and builder), tRNA (carrier of amino acids), amino acids (protein building blocks)
- The mRNA is now going to be read in groups of three nucleotides, called codons.
- AUG is a specific codon which forms the amino acid methionine, acting as a START CODON to signal the start of protein building.
- The following three STOP codons DO NOT specify amino acids: UAA, UAG, and UGA. They alert the cell when a polypeptide is complete.
Transfer RNAs (tRNAs)
- Each tRNA molecule carries a specific amino acid.
- tRNA has an anticodon, a set of 3 bases that are complementary to the mRNA codon.
- Process:
- The tRNA matches its anticodon to the codon on the mRNA.
- The ribosome links the carried amino acid of tRNA to the growing protein chain.
- The ribosome moves along the mRNA, reading codons and bringing the correct tRNAs one by one.
Ribosomes
- This is where the actual proteins are built.
- They are made up of proteins and ribosomal RNA (rRNA).
- It has a large and a small subunit - coming together to wrap around the mRNA.
Translation steps
- Inititation: The initiation complex is set up → ribosome assembles around the mRNA and the first tRNA
- Elongation: Each codon is read and a matching amino acids is added the growing polypeptide chain
- Termination: When a stop codon is reached, the polypeptide chain is released from the ribosome
Codons and Reading Frame
- Codon: A group of 3 mRNA bases (nucleotides) that codes for one amino acid.
- The ribosome reads the mRNA 3 nucleotides at a time (so the 5th amino acid in the sequence is coded by the 5th codon).
- Examples of codons:
- AUG = Start codon (also codes for methionine)
- UUU = Phenylalanine
- GGC = Glycine
- There are specific codons for each amino acid, and the matching is universal across most living organisms.
How Protein Shape Depends on Sequence:
Protein structure levels:
- Primary structure: Sequence of amino acids.
- Secondary structure: Folding into alpha helices or beta sheets.
- Tertiary structure: Overall 3D shape due to side chain interactions.
- Quaternary structure (for some proteins): Arrangement of multiple polypeptide chains.
The order of amino acids directly influences how the protein folds and its final shape.
A protein’s shape determines its function (e.g., enzymes, hormones, antibodies).
Quick Summary Table
Step |
Location |
Main Idea |
Transcription |
Nucleus |
DNA to mRNA |
Translation |
Ribosome (cytoplasm) |
mRNA to Amino acid chain |
Codon Reading |
Ribosome |
Reads 3 bases (codon) at a time |
tRNA Role |
Cytplasm |
Carries specific amino acid, matches codon |
Protein Folding |
Endoplasmic Reticulum |
Sequence of amino acids decides final 3D shape |
Fun Facts!
- The genetic code is universal: same codons code for the same amino acids in almost all organisms!
- One mistake in sequence (mutation) can change a protein’s shape and cause diseases (like sickle-cell anemia).
Written by Kasiban Parthipan